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AMENDMENTS TO THE CLAIMS: 

This listing of claims will replace all prior versions, and listings, of claims in the 
application: 

LISTING OF CLAIMS 

1. (Previously Presented) Method for recognizing text in a captured imagery, where 
said captured imagery is of a three-dimensional scene, said method comprising the 
steps of: 

(a) detecting a text region in the captured imagery; 

(b) adjusting along three axes said detected text region to produce a rectified 
image; and 

(c) applying optical character recognition (OCR) processing to said rectified 
image to recognize the text in the captured imagery. 

2. (Original) The method of claim 1, wherein said adjusting step (b) comprises the 
step of (b1) computing a base line and a top line for a line of detected text within said 
detected text region. 

3. (Original) The method of claim 2, wherein said base line and said top line 
correlate substantially to horizontal parallel lines of a rectangular bounding box that is 
fitted to said line of detected text. 

4. (Original) The method of claim 2, wherein said base line and said top line are 
estimated by rotating said line of detected text at various angles and then computing a 
plurality of horizontal projections over a plurality of vertical edge projections. 

5. (Original) The method of claim 4, wherein said base line is selected that 
corresponds to a rotation angle that yields a steepest slope on a bottom side of one of 
said plurality of horizontal projections. 
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6. (Original) The method of claim 4, wherein said top line is selected that 
corresponds to a rotation angle that yields a steepest slope on a top side of one of said 
plurality of horizontal projections. 

7. (Original) The method of claim 2, wherein said base line is selected comprising 
the steps of: 

locating a plurality of bottom edge pixels, where each bottom edge pixel is 
located for each column in said rectangular bounding box; 

rotating said plurality of bottom edge pixels through a series of angles around an 
initial estimated text angle for said line of detected text; 

summing horizontally along each row; and 

determining a baseline angle from a maximum sum of squared projections and 
determining a baseline position from a maximum projection. 

8. (Original) The method of claim 2, wherein said top line is selected comprising 
the steps of: 

locating a plurality of top edge pixels, where each top edge pixel is located for 
each column in said rectangular bounding box; 

rotating said plurality of top edge pixels through a series of angles around an 
initial estimated text angle for said line of detected text; 

summing horizontally along each row; and 

determining a top line angle from a maximum sum of squared projections and 
determining a top line position from a maximum projection. 

9. (Original) The method of claim 2, wherein said adjusting step (b) further 
comprises the step of (b2) computing a dominant vertical direction of character strokes 
for a line of detected text within said detected text region. 
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10. (Original) The method of claim 9, wherein said dominant vertical direction 
computing step (b2) comprises the step of computing a plurality of vertical projections 
over a plurality of vertical edge transitions after rotating said line of detected text in a 
plurality of degree increments. 

11. (Original) The method of claim 10, wherein said dominant vertical direction is 
selected that corresponds to an angle where a sum of squares of said vertical 
projections is a maximum. 

12. (Original) The method of claim 1 , further comprising the step of: 

(b1) binarizing said detected text region prior to applying said OCR processing 
step (c). 

13. (Original) The method of claim 12, further comprising the step of: 

(d) applying agglomeration processing subsequent to said OCR processing to 
produce the text in the captured imagery. 

14. (Original) The method of claim 13, further comprising the step of: 

(e) applying lexicon processing subsequent to said agglomeration processing to 
produce the text in the captured imagery. 

15. (Original) The method of claim 14, further comprising the step of: 

(f) applying false text elimination processing subsequent to said lexicon 
processing to produce the text in the captured imagery. 

16. (Previously Presented) Apparatus for recognizing text in a captured imagery, 
where said captured imagery is of a three-dimensional scene, said apparatus 
comprising: 

means for detecting a text region in the captured imagery; 
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means for adjusting along three axes/said detected text region to produce a 
rectified image; and 

means for applying optical character recognition (OCR) processing to said 
rectified image to recognize the text in the captured imagery. 

17. (Original) The apparatus of claim 16, wherein said adjusting means computes a 
base line and a top line for a line of detected text within said detected text region. 

18. (Original) The apparatus of claim 17, wherein said base line and said top line 
correlate substantially to horizontal parallel lines of a rectangular bounding box that is 
fitted to said line of detected text. 

19. (Original) The apparatus of claim 17, wherein said base line and said top line are 
estimated by rotating said line of detected text at various angles and then computing a 
plurality of horizontal projections over a plurality of vertical edge projections. 

20. (Original) The apparatus of claim 19, wherein said base line is selected that 
corresponds to a rotation angle that yields a steepest slope on a bottom side of one of 
said plurality of horizontal projections. 

21. (Original) The apparatus of claim 19, wherein said top line is selected that 
corresponds to a rotation angle that yields a steepest slope on a top side of one of said 
plurality of horizontal projections. 

22. (Original) The apparatus of claim 17, wherein said adjusting means further 
computes a dominant vertical direction of character strokes for a line of detected text 
within said detected text region. 

23. (Original) The apparatus of claim 22, wherein said adjusting means computes 
said dominant vertical direction by computing a plurality of vertical projections over a 
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plurality of vertical edge transitions after rotating said line of detected text in a plurality 
of degree increments. 

24. (Withdrawn) Method for recognizing text in a captured imagery having a plurality 
of frames, said method comprising the steps of: 

(a) detecting a text region in a frame of the captured imagery; 

(b) applying optical character recognition processing (OCR) to said detected text 
region to identify potential text for said frame; and 

(c) agglomerating the OCR identified potential text over a plurality of frames in 
the captured imagery to recognize the text in the detected text region. 

25. (Withdrawn) The method of claim 24, wherein said agglomerating step (c) 
comprises the step of updating an agglomeration structure with said OCR identified 
potential text of a current frame. 

26. (Withdrawn) The method of claim 25, wherein said updating step comprises the 
step of (d) finding correspondence between a text region of said agglomeration 
structure with a text region of said current frame. 

27. (Withdrawn) The method of claim 26, wherein said updating step further 
comprises the step of (c2) finding character-to-character correspondence for each pair 
of overlapping lines between said text region of said agglomeration structure with said 
text region of said current frame to find one or more character group pairs. 

28. (Withdrawn) The method of claim 27, wherein said updating step further 
comprises the step of (c3) updating said one or more character group pairs. 

29. (Withdrawn) The method of claim 28, wherein said updating step further 
comprises the step of (c4) marking text in said agglomeration structure that is not in said 
current frame as a deletion. 
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30. (Withdrawn) The method of claim 29, wherein said updating step further 
comprises the step of (c5) marking text in said current frame that is not in said 
agglomeration structure as an insertion. 

31 . (Withdrawn) The method of claim 25, further comprising the step of: 

(d) outputting said text in the detected text region after each processed frame. 

32. (Withdrawn) The method of claim 25, further comprising the step of: 

(d) outputting said text in the detected text region only when a change is detected 
as to said text of said captured imagery. 

33. (Withdrawn) The method of claim 25, further comprising the step of: 

(d) outputting only said text within said agglomeration structure when said text is 
not detected in a current frame. 

34. (Withdrawn) Apparatus for recognizing text in a captured imagery having a 
plurality of frames, said apparatus comprising: 

means for detecting a text region in a frame of the captured imagery; 

means for applying optical character recognition processing (OCR) to said 
detected text region to identify potential text for said frame; and 

means for agglomerating the OCR identified potential text over a plurality of 
frames in the captured imagery to extract the text in the detected text region. 

35. (Withdrawn) The apparatus of claim 34, wherein said agglomerating means 
updates an agglomeration structure with said OCR identified potential text of a current 
frame. 
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36. (Withdrawn) The apparatus of claim 35, wherein said agglomerating means finds 
correspondence between a text region of said agglomeration structure with a text region 
of said current frame. 

37. (Withdrawn) The apparatus of claim 36, wherein said agglomerating means 
further finds character-to-character correspondence for each pair of overlapping lines 
between said text region of said agglomeration structure with said text region of said 
current frame to find one or more character group pairs. 

38. (Withdrawn) The apparatus of claim 37, wherein said agglomerating means 
further updates said one or more character group pairs. 

39. (Withdrawn) The apparatus of claim 38, wherein said agglomerating means 
further marks text in said agglomeration structure that is not in said current frame as a 
deletion. 

40. (Withdrawn) The apparatus of claim 39, wherein said agglomerating means 
further marks text in said current frame that is not in said agglomeration structure as an 
insertion. 

41 . (Withdrawn) The apparatus of claim 35, further comprising: 

means for outputting said text in the detected text region after each processed 

frame. 

42. (Withdrawn) The apparatus of claim 35, further comprising: 

means for outputting said text in the detected text region only when a change is 
detected as to said text of said captured imagery. 



8 



09/895,868 



43. (Withdrawn) The apparatus of claim 35, further comprising: 

means for outputting only said text within said agglomeration structure when said 
text is not detected in a current frame. 

44. (Previously Presented) Method for recognizing text in a captured imagery, said 
method comprising the steps of: 

(a) detecting a text region in the captured imagery; 

(b) adjusting said detected text region to produce a rectified image; 

(c) applying optical character recognition (OCR) processing to said rectified 
image to recognize the text in the captured imagery; 

wherein said adjusting step (b) comprises the step of (b1) computing a base line 
and a top line for a line of detected text within said detected text region; 

wherein said base line and said top line are estimated by rotating said line of 
detected text at various angles and then computing a plurality of horizontal projections 
over a plurality of vertical edge projections; and 

wherein said base line is selected that corresponds to a rotation angle that yields 
a steepest slope on a bottom side of one of said plurality of horizontal projections. 

45. (Previously Presented) Method for recognizing text in a captured imagery, said 
method comprising the steps of: 

(a) detecting a text region in the captured imagery; 

(b) adjusting said detected text region to produce a rectified image; 

(c) applying optical character recognition (OCR) processing to said rectified 
image to recognize the text in the captured imagery; 

wherein said adjusting step (b) comprises the step of (b1 ) computing a base line 
and a top line for a line of detected text within said detected text region; 

wherein said base line and said top line are estimated by rotating said line of 
detected text at various angles and then computing a plurality of horizontal projections 
over a plurality of vertical edge projections; and 
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wherein said top line is selected that corresponds to a rotation angle that yields a 
steepest slope on a top side of one of said plurality of horizontal projections. 

46. (Previously Presented) Method for recognizing text in a captured imagery, said 
method comprising the steps of: 

(a) detecting a text region in the captured imagery; 

(b) adjusting said detected text region to produce a rectified image; 

(c) applying optical character recognition (OCR) processing to said rectified 
image to recognize the text in the captured imagery; 

wherein said adjusting step (b) comprises the step of (b1) computing a base line 
and a top line for a line of detected text within said detected text region; 

said adjusting step (b) further comprises the step of (b2) computing a dominant 
vertical direction of character strokes for a line of detected text within said detected text 
region; and 

wherein said dominant vertical direction computing step (b2) comprises the step 
of computing a plurality of vertical projections over a plurality of vertical edge transitions 
after rotating said line of detected text in a plurality of degree increments. 

47. (Previously Presented) The method of claim 46, wherein said dominant vertical 
direction is selected that corresponds to an angle where a sum of squares of said 
vertical projections is a maximum. 

48. (Previously Presented) Apparatus for recognizing text in a captured imagery, 
said apparatus comprising: 

means for detecting a text region in the captured imagery; 

means for adjusting said detected text region to produce a rectified image; and 

means for applying optical character recognition (OCR) processing to said 
rectified image to recognize the text in the captured imagery; 

wherein said adjusting means computes a base line and a top line for a line of 
detected text within said detected text region; 
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wherein said base line and said top line are estimated by rotating said line of 
detected text at various angles and then computing a plurality of horizontal projections 
over a plurality of vertical edge projections; and 

wherein said base line is selected that corresponds to a rotation angle that yields 
a steepest slope on a bottom side of one of said plurality of horizontal projections. 

49. (Previously Presented) Apparatus for recognizing text in a captured imagery, 
said apparatus comprising: 

means for detecting a text region in the captured imagery; 

means for adjusting said detected text region to produce a rectified image; and 

means for applying optical character recognition (OCR) processing to said 
rectified image to recognize the text in the captured imagery; 

wherein said adjusting means computes a base line and a top line for a line of 
detected text within said detected text region; 

wherein said base line and said top line are estimated by rotating said line of 
detected text at various angles and then computing a plurality of horizontal projections 
over a plurality of vertical edge projections; and 

wherein said top line is selected that corresponds to a rotation angle that yields a 
steepest slope on a top side of one of said plurality of horizontal projections. 

50. (Previously Presented) Apparatus for recognizing text in a captured imagery, 
said apparatus comprising: 

means for detecting a text region in the captured imagery; 

means for adjusting said detected text region to produce a rectified image; and 

means for applying optical character recognition (OCR) processing to said 
rectified image to recognize the text in the captured imagery; 

wherein said adjusting means computes a base line and a top line for a line of 
detected text within said detected text region; 

wherein said adjusting means further computes a dominant vertical direction of 
character strokes for a line of detected text within said detected text region; and 
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wherein said adjusting means computes said dominant vertical direction by 
computing a plurality of vertical projections over a plurality of vertical edge transitions 
after rotating said line of detected text in a plurality of degree increments. 
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